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BLAST 



Basic Local Alignment Search Tool 




Blast 2 sequences 

gb|AAP41037| (1255 letters) 

Results for: 

gb|AAP41 037.1 E2 glycoprotein precursor [Severe acute respiratory syndrome- related coronavirus] >gi|30795145|gb|A...(1255aa) 
Your BLAST job specified more than one input sequence. This box lets you choose which input sequence to show BLAST 
results for. 

Query ID 

gi|307951 45|gb| AAP41 037. 1 j 
Description 

E2 glycoprotein precursor [Severe acute respiratory syndrome-related coronavirus] >gi|30795145|gb|AAP41037.1| 

spike glycoprotein [SARS coronavirus Tor2] 
Molecule type 

amino acid 
Query Length 

1255 

Subject ID 

15461 
Description 

SID 3 of 10/843656 
Molecule type 

amino acid 
Subject Length 

1255 
Program 

BLASTP 2.2.23+ < 

felcrcrr,-, 

Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. 
Lipman (1997), "Gapped BLAST and PSl-BLAST: a new generation of protein database search programs", Nucleic Acids 
Res. 25:3389-3402. 

Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and 
Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. 
Other reports: 



Search Parameters 



Search parameter name Search parameter value 



Program blastp 



Word size 3 

Expect value 10 

Hitiistsize 100 

Gapcosts 11,1 



Matrix BLOSUM62 



Filter string F 

Genetic Code 1 

Window Size 40 

Threshold 1 1 

Composition-based stats 2 
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Karlin-Altschul statistics 



Params Ungapped Gapped 



Lambda 0.321182 0.267 
K 0.135961 0.041 

H 0.41432 0.14 



Results Statistics 



Results Statistics parameter name Results Statistics parameter value 

Effective search space 1 456849 



Distribution of 1 Blast Hits on the Query Sequence 



I2L 



An overview of the database sequences aligned to the query sequence is shown. The score of each alignment is indicated by 
one of five different colors, which divides the range of scores into five groups. Multiple alignments on the same database 
sequence are connected by a striped line. Mousing over a hit sequence causes the definition and score to be shown in the 
window at the top, clicking on a hit sequence takes the user to the associated alignments. New: This graphic is an overview 
of database sequences aligned to the query sequence. Alignments are color-coded by score, within one of five score ranges. 
Multiple alignments on the same database sequence are connected by a dashed line. Mousing over an alignment shows the 
alignment definition and score in the box at the top. Clicking an alignment displays the alignment detail. 



Query 
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Plot of gi|30795145|gb|AAP41037.1| vs 15461 [?] 

This dot matrix view shows regions of similarity based upon the BLAST results. The query sequence is represented on the 
X-axis and the numbers represent the bases/residues of the query. The subject is represented on the Y-axis and again the 
numbers represent the bases/residues of the subject. Alignments are shown in the plot as lines. Plus strand and protein 
matches are slanted from the bottom left to the upper right corner, minus strand matches are slanted from the upper left to 
the lower right. The number of lines shown in the plot is the same as the number of alignments found by BLAST. 

Description 

Legend for links to other resources: UniGene GEO B Gene Structure E3 Map Viewer 
Sequences producing significant alignments: 

Accession Description ktax store TctaUoor Query • Links 

15461 SID 3 of 10/843656 : 2600 100% 0.0 
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MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDTLYLTQDLFL 60 
MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDTLYLTQDLFL 

MFI FLLFLTLTSGSDLDRCTT FDDVQAPNYTQHTSSMRGVYYPDEI FRS DTLYLTQDLFL 6 0 

PFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNWRGWVFGSTMNNKSQSVIIINNS 120 
PFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNWRGWVFGSTMNNKSQSVII INNS 

PFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNWRGWVFGSTMNNKSQSVIIINNS 120 

TNWIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCT FEYISDAFSLDVSEKSGNFK 180 
TNWIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYISDAFSLDVSEKSGNFK 

TNWIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCT FEYISDAFSLDVSEKSGNFK 180 

HLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSP 240 
HLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSP 

HLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSP 240 

AQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIY 300 
AQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIY 

AQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKS FEIDKGIY 300 

QTSNFRWPSGDWRFPNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTF 360 
QTSNFRWPSGDWRFPNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTF 

QTSNFRWPSGDWRFPNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTF 360 

FSTFKCYGVSATKLNDLCFSNVYADSFVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCV 420 
FSTFKCYGVSATKLNDLCFSNVYADSFVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCV 

FSTFKCYGVSATKLNDLCFSNVYADSFWKGDDVRQIAPGQTGVIADYNYKLPDDFMGCV 420 

LAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLND 4 80 
LAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLND 

LAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLND 480 



YGFYTTTGIGYQPYRVWLSFELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTP 
YGFYTTTGIGYQPYRVWLSFELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTP 
YGFYTTTGIGYQPYRVWLSFELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTP 



SSKRFQPFQQFGRDVSDFTDSVRDPKTSEILDISPC 
SSKRFQPFQQFGRDVS DFTDSVRDPKTSE I LD I S PC 
SSKRFQPFQQFGRDVSDFTDSVRDPKTSEILDISPC 



540 
540 



'GGVS VI T PGTNAS SEVAVL YQD 
'GGVS VI T PGTNAS SEVAVLYQD 
"GGVSVITPGTNASSEVAVLYQD 



VNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASY 660 
VNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASY 

VNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASY 660 

HTVSLLRSTSQKSIVAYTMSLGADSS IAYSNNTIAI PTNFSISITTEVMPVSMAKTSVDC 720 
HTVSLLRSTSQKS IVAYTMSLGADSS IAYSNNT IAI PTNFSISITTEVMPVSMAKTSVDC 

HTVSLLRSTSQKSIVAYTMSLGADSS IAYSNNTIAI PTNFSISITTEVMPVSMAKTSVDC 720 

NMYICGDSTECANLLLQYGSFCTQLNRALSGIAAEQDRNTREVFAQVKQMYKTPTLKYFG 780 
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GFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGL 840 
GFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGL 
GFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGL 840 

TVLPPLLTDDMIAAYTAALVSGTATAGWT FGAGAALQIPFAMQMAYRFNGIGVTQNVLYE 900 
TVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYE 
TVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAALQI PFAMQMAYRFNGIGVTQNVLYE 900 

NQKQIANQFNKAISQIQESLTTTSTALGKLQDWNQNAQALNTLVKQLSSNFGAISSVLN 960 
NQKQIANQFNKAISQIQESLTTTSTALGKLQDWNQNAQALNTLVKQLSSNFGAISSVLN 
NQKQIANQFNKAISQIQESLTTTSTALGKLQDWNQNAQALNTLVKQLSSNFGAISSVLN 960 

DILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSK 1020 
DILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSK 
DILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSK 1020 

RVDFCGKGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPAICHEGKAYFPREGVFVFN 1080 
RVDFCGKGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPAICHEGKAYFPREGVFVFN 
RVDFCGKGYHLMSFPQAAPHGVVFLHVTYVPSQERN FTTAPAICHEGKAYFPREGVFVFN 1080 

GTSWFITQRNFFSPQI ITTDNTFVSGNCDVVIGI INNTVYDPLQPELDSFKEELDKYFKN 1140 
GTSWFITQRNFFSPQIITTDNTFVSGNCDVVIGI INNTVYDPLQPELDSFKEELDKYFKN 
GTSWFITQRNFFSPQI ITTDNTFVSGNCDVVIGI INNTVYDPLQPELDSFKEELDKYFKN 114 0 

HTSPDVDLGDISGINASWNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWL 1200 
HTSPDVDLGDISGINASWNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWL 
HTSPDVDLGDISGINASWNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWL 1200 

GFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT 1255 
GFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT 
GFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT 1255 
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